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RNA VIRUS-DERIVED PLANT EXPRESSION SYSTEM 

FIELD OF THE INVENTION 

The present invention relates to the plant, plant parts or plant cell cultures having a 
heterologous DNA encoding an RNA replicon for expressing a sequence of interest. The 
invention also provides a process of expressing a sequence of interest in the plants, plant 
parts or plant cell cultures of the invention. The process and vectors provide the plant 
cells with an increased frequency of RNA virus-derived RNA replicon formation. Said 
heterologous DNA or part(s) thereof can be stably incorporated into the plant nuclear 
chromosomal or episomal DNA or transiently delivered as DNA molecule(s). The system 
that is based on stable incorporation of said precursor vector into plant chromosomal DNA 
is tightly controllable and allows for leakage-proof (zero expression level in non-induced 
state) control over the expression of gene of interest. 

BACKGROUND OF THE INVENTION 

Among plant transgene expression systems, expression of a transgene under the 
control of a heterologous promoter has been in use for several years. Apart from such 
conventional plant expression systems, virus-based expression systems can be used for 
rapid protein production in plants (for review see: Porta & Lomonossoff, 1996, MoL 
BiotechnoL, 5, 209-221; Yusibov et al., 1999, Curr.Top. Microbiol. Immunol., 240, 81-94) 
and are a powerful tool for functional genomics studies (Dalmay et al, 2000, Plant Cell, 
12. 369-379; Ratcliff et al., 2001, Plant J., 25, 237-245; Escobar et aL 9 2003, Plant Cell, 
15, 1507-1523). Numerous publications and patents in the field describe systems based 
on DNA and RNA viral vectors (Kumagai et al., 1994, Proc. Natl. Acad. Sci. USA, 90, 427- 
430; Mallory et al, 2002, Nature BiotechnoL 20^622-625; Mor ef al., 2003, BiotechnoL 
Bioeng., Ml 430-437; US5316931; US5589367; US5866785; US5491076; US5977438; 
US5981236; WO02088369; WO02097080; W09854342). The existing viral vector 
systems are usually restricted to a narrow host range in terms of their best performance 
and even the expression level of such vectors in their most favourable host is far below 
the upper biological limits of the system. An important issue of virus-based systems is the 
method of delivery of the viral replicon to a plant cell. The most broadly applied method of 
delivery for large-scale production (simultaneous production in many plants, e.g. in a farm 
field or a greenhouse) is the use of infectious copies of RNA viral vectors (Kumagai ef al., 
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1995, Proc. Natl. Acad. Sci. USA, 92, 1679-1683). Because of a relatively high tendency 
of recombinant viral RNA vectors to lose the heterologous inserts during the cycles of their 
replication, the method requires transcription of DNA templates in vitro, and as a result is 
inefficient and expensive. Another approach to solve the delivery problem could be the 
presence of a viral RNA replicon precursor in each cell of a transgenic plant, such that it 
can be released upon triggering the replication process by complementing a function of 
the viral vector (e.g. using helper virus - US5965794) or using other regulated switch 
systems (e.g. site-specific recombination - US6632980). 

Despite many publications in the field including patented technologies, there are 
still no large scale virus-based production systems that work with sufficient efficiency and 
yield for commercial high-yield production, predominantly due to two main reasons: 
Firstly, transient plant virus-based expression systems are generally restricted to specific 
hosts, which may not be suitable for large scale cultivation due to their susceptibility to 
environmental factors. Moreover, they are generally restricted to certain parts of a plant 
host, thus excluding most of the plant biomass from the production process and as a 
result minimizes the relative yield of recombinant product per unit of plant biomass down 
to a level comparable to that achievable by a conventional transcription promoter in a 
transgenic plant; 

Secondly, attempts to scale up the virus-based production system by generating 
transgenic plant hosts having the viral replicon precursor stably integrated in each cell did 
not provide a solution either, in particular because of underperformance of said replicons 
in such position, "leakiness" of the gene of interest to be expressed from said replicon and 
lack of an efficient switch system for said vectors. Certain progress was achieved with 
PVX-based vectors by using suppressors of PTGS silencing as trigger of RNA replicon 
formation (Mallory et al., 2002, Nature Biotechnol., 20, 622-625), but the system is still far 
below the practical value, as there is no solution provided for an efficient control of the 
switch (PTGS suppressor) triggering viral vector replication. However, this system 
provided for an expression level of the GUS gene reaching 3% of total soluble protein 
(TSP), which is the best known so far for this type of system, but still no better than a 
conventional transgene expression system under control of a strong promoter. Another 
inducible system based on a plant tripartite RNA virus (Mori et al., 2001, Plant J., 27, 79- 
86), Brome Mosaic Virus (BMV), gave a very low yield of the protein of interest (3-4 pg/g 
fresh weight), which is comparable with the yields provided by standard transcriptional 
promoters. The authors believe they can increase the yield of the protein of interest by 
optimising the codon usage of the gene encoding said protein. However, it is very unlikely 
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that such optimisation can lead to an expression level higher than that obtainable with 
strong constitutive promoters in transgenic plants. 

The low expression levels achieved so far with plant expression systems are a 
major reason why these systems are hardly competitive with other expression systems 
like bacterial, fungal, or insect cell expression systems. Low expression levels give rise to 
very high downstream costs for protein isolation and purification in a huge background of 
plant material. Therefore, costs for downstream processing quickly decrease, as the yield 
of the protein or product of interest per unit plant biomass increases. 

There is presently no large-scale plant transgene expression system the yield and 
efficiency of which would be high sufficient to compete on the market with other large- 
scale expression systems like bacterial, fungal, or insect cell expression systems. Such a 
plant expression would have to fulfill the following criteria as good as possible: 

(i) high yield, including expression of the protein of interest in as many plant tissues 
as possible and in as many cells of said tissues; 

(ii) for preventing a deleterious effect of protein expression on plant growth, 
expression of the protein or product of interest should be switchable such that 
expression can be switched on at a desired point in time. 

(iii) the switching should be such that expression can be switched on simultaneously 
or nearly simultaneously in all tissues or cells of a plant and, at the same time, in 
all plants of a selected group of plants, e.g. in all plants of a selected lot of plants. 
Typically, the protein or product of interest accumulates in each cell producing said 
product or protein up to a certain point. During accumulation, however, degradative 
processes frequently set on that tend to reduce the yield or quality of the protein or 
product of interest. Therefore, there is an optimal point in time after switching on 
expression, where the product or protein of interest should be harvested. This 
optimal point in time should be reached in all tissues or cells of a plant and in all 
plants of a selected lot at the same time to make the overall process efficient and 
profitable. 

Therefore, it is an object of this invention to provide a transgenic plant, plant part, 
or plant cell culture for a high-yield plant expression system. 

It is another object of the invention to provide an efficient process of expressing 
one or more sequences of interest in a plant, plant part of plant cell culture, whereby said 
process can be used efficiently on large scale. 



4 

Further, it is an object of the invention to provide a method of controlling the 
expression of nucleic acid sequence(s) of interest in a plant, plant part, or plant cell 
culture, which is of improved ecological and biological safety as it allows to have a zero 
expression level of said nucleic acid sequences of interest in the inactive state. 

GENERAL DESCRIPTION OF THE INVENTION 

The above objects are achieved by a stably transformed plant, plant part, or plant 
cell culture containing in cell nuclei a heterologous DNA having a sequence encoding an 
RNA replicon operably linked or linkable to a transcription promoter, 
wherein said sequence encoding an RNA replicon contains 

(i) sequences for replicon function of said RNA replicon, said sequences being 
derived from a sequence of a plant RNA virus, and 

(ii) a sequence of interest to be expressed from said RNA replicon, 

whereby said sequences for replicon function correspond to sequences of said plant RNA 
virus and exhibit at selected localities of said sequence of said plant RNA virus function- 
conservative differences from said sequence of said plant RNA virus, said differences 
causing an increased frequency of replicon formation compared to an RNA replicon not 
exhibiting said differences. 

Further, the above objects are achieved by a process of expressing a sequence of 
interest in a plant, plant part, or plant cell culture, comprising: 

(a) providing a plant, plant part, or plant cell culture containing in cell nuclei a 
heterologous DNA as defined above and 

(b) causing expression of said sequence of interest. 

The invention further provides a nucleic acid molecule for producing said plant, 
plant part, or plant cell culture of the invention or for carrying out the process of the 
invention. Said nucleic acid molecule is as defined in the claims and as further defined as 
described below with reference to the plant, plant part of plant cell culture of the invention. 

When the inventors of the present invention introduced a heterologous DNA 
encoding RNA-viral replicons in nuclear chromosomes of plants or plant parts for 
expressing a protein of interest encoded in said RNA replicon, they found that the 
frequency with which RNA replicons appeared in the cytosol "was very low and occurred 
only in a small fraction of cells containing the heterologous DNA. Accordingly, the 
expression level of the protein of interest was also very low. Many possible reasons for 
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this problem were considered including positional effects of the chromosome, unsuitable 
transcription regulatory elements, gene silencing, defective transport of the replicon from 
the nucleus to the cytosol. a deleterious effect of the sequence of interest on transcription, 
RNA processing or replicon function etc. It was then surprisingly found that certain A/T(U)- 
rich sequence portions in the replicon were responsible for the low frequency of replicon 
formation in the cytosol. When the deleterious effect of said A/T(U)-rich sequence portions 
was suppressed, the frequency of replicon formation in the cytosol strongly increased, 
resulting in a strongly increased yield of the protein of interest. 

The efficiency of the present invention is such that a new dimension in plant 
expression systems is attained. The expression levels achievable with the present 
invention are such that expenditures for downstream processing (including separation and 
purification of the protein of interest) are low enough to make the process of the invention 
competitive with other large-scale expression systems. In prior art expression systems 
using stably transformed plants, the expression level is low even if virus-based vectors are 
used, since replicons are produced in a small fraction of the cells. Replicons that spread in 
the plant cannot remedy this problem, as spreading is slow, notably over long distances. 
Therefore, expression does not proceed uniformly in the plant and degradation of the 
protein of interest will already take place in some parts of the plant while in others protein 
expression has not even started. The invention allows to trigger expression uniformly 
throughout the plant. The small fraction of cells that do not produce a replicon can be 
quickly infected by replicons from neighbouring cells. The invention provides the first high- 
yield plant expression system that can be used on large scale. The invention even allows 
to produce two or more replicons in the same cell, whereby probability of having both 
replicons in the same cells is still very high. Further, the efficiency of the expression 
system of the invention is such that the otherwise limited plant specificity of RNA viruses 
is reduced. 

The stably transformed plant, plant part, or plant cell culture and said nucleic acid 
molecule have a heterologous DNA encoding an RNA replicon. Said sequence encoding 
an RNA replicon contains 

(i) sequences for replicon function of said RNA replicon, said sequences being 
derived from a sequence of a plant RNA virus, and 

(ii) a sequence of interest to be expressed from said RNA replicon. 

Said sequences for replicon function contain a sequence of interest to be 
expressed from said RNA replicon (ii). Said sequence of interest to be expressed may 
lead to formation of an RNA of interest, like an RNA for RNA interference for suppressing 
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a function of said plant. Preferably, however, said sequence of interest codes for a protein 
of interest and contains regulatory sequences for translating said protein of interest from 
said RNA replicon or from subgenomic RNA or said RNA replicon. 

Said sequences for replicon function of said RNA replicon (i) correspond to 
sequences of said plant RNA virus inter alia in that the former are a DNA copy of the 
latter. Said sequences for replicon function provide the RNA replicon with the function to 
replicate in the cytosol. Said sequences for replicon function typically code for one or 
more proteins involved in replication like an RNA-dependent RNA polymerase (replicase). 
Said sequences for replicon function may further code for functions of an RNA replicon 
like one or more proteins involved in cell-to-cell or systemic spreading of an RNA virus in 
a plant like a movement protein or a coat protein. Said sequences for replicon function are 
preferably derived from a sequence of a plant RNA virus, since plant RNA viruses are an 
easily accessible source for replicon functions. "Being derived" means that said 
sequences for replicon function are a DNA copy of the corresponding sequences of said 
RNA virus and said DNA copy makes up a part of said heterologous DNA contained in cell 
nuclei. "Being derived" further means that said sequences for replicon function are not an 
exact DNA copy of the corresponding RNA sequence of said RNA virus, but exhibit 
function-conservative differences as described below. Since said differences are function- 
conservative, said sequences for replicon function preferably code for proteins capable of 
carrying out replicon functions similarly as they do for said RNA virus. 

Said sequences for replicon function exhibit at selected localities of said sequence 
of said plant RNA virus function-conservative differences relative to said sequence of said 
plant RNA virus, said differences causing an increased frequency of replicon formation 
compared to an RNA replicon not exhibiting said differences. Said differences are causal 
for said increased frequency of replicon formation in plant cells, once the overall process 
has been switched on (see below). The causal connection between the increased 
frequency of replicon formation and said differences can be tested experimentally by 
comparing the frequency of replicon formation between sequences for replicon function 
having said differences and sequences for replicon function not having said differences. 
Such an experimental comparison can be made e.g. by counting protoplasts expressing 
said sequence of interest as described in the examples. Preferably, a sequence of interest 
coding for an easily detectable reporter protein like green fluorescent protein (GFP) is 
used for this purpose. As further described below, it is also preferred to perform the 
experimental comparison with RNA replicons not capable of cell-to-cell spreading. 

Said function-conservative differences are introduced into said sequences for 
replicon function at selected localities of said sequence of said plant RNA virus. Said 
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selected localities are localities in sequences for replicon function of said plant RNA virus 
that are responsible for a low probability of an RNA replicon transcribed in the nucleus to 
appear in the cytosol as a functional replicon. Preferably, such selected localities have a 
high A/T(U)-content, i.e. a high A-content and/or a high T-content (a high U-content on 
RNA level), or have cryptic splicing sites, i.e. sequence portions that are recognized by 
the nuclear splicing machinery as splicing sites. Said selected localities may be identified 
in an RNA virus on which an RNA replicon is based by analyzing the RNA profile of the 
RNA virus as exemplified below. Further, selected localities may be identified 
experimentally by analyzing the RNA formed in a plant cell after transformation with a 
heterologous DNA encoding an RNA replicon that does not exhibit the differences 
according to the invention. This experimental analysis may be done by RT-PCR, 
preferably together with sequencing of the RT-PCR products. In this way, undesired 
splicing products that indicate splicing events destroying the RNA replicon may be 
identified. Further, the exact sites of undesired splicing may be identified and then 
remedied by introducing said function-conservative differences at said selected localities. 

Said function-conservative differences cause an increased frequency of RNA 
replicon formation by suppressing the deleterious effect of said selected localities on said 
frequency of RNA replicon formation. Said function-conservative differences may 
comprise a reduction of a high A/U-content in said RNA replicon by reducing a high A/T 
content in said sequences for replicon function of said sequence encoding said RNA 
replicon. Said high A/U content may be reduced by at least partial deletion or at least 
partial replacement by G/C bases (e.g. using the degeneracy of the genetic code), 
provided said differences are function-conservative. Further, cryptic splicing sites flanking 
A/U-rich regions of said sequences derived from a plant RNA virus may be removed. 
Such function-conserved differences may be introduced at one or at, preferably, several 
selected localities. 

Preferred function-conservative differences comprise the insertion of one or more 
introns, most preferably nuclear introns, or one or more sequences capable of forming 
nuclear introns near or within A/U-rich localities of said sequences being derived from 
sequences of said plant RNA virus. It has surprisingly been found that the introduction of 
introns at or near A/U-rich localities results in an increased frequency of RNA replicon 
formation. Several introns may be introduced and examples are given herein for various 
numbers of introduced introns. The effects of more than one intron are cumulative. 
Further, intron insertion may be combined with other function-conservative differences at 
other selected localities. 
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Fig. 8 shows an example for the introduction of sequences capable of forming a 
nuclear intron, albeit in the gene of interest to be expressed. In the example of Fig. 8, the 
intron is formed from two intron halfs upon recombinase-catalyzed flipping of a part of said 
heterologous DNA. This principle may also be applied to sequences for replicon function 
of said RNA replicon. In an embodiment wherein two different RNA replicons are formed 
in the same cell, recombination between said two different replicons may result in the 
formation of an intron from two intron halfs present on different replicons. Further, an RNA 
replicon may be formed by recombination between two precursors, neither of which is a 
replicon. Also in this case, an intron may be assembled from two intron halfs derived from 
different precursor molecules. 

Said plant, said plant part, or said plant cell culture of the invention are stably 
transformed with said heterologous DNA. Stably transformed means that they contain said 
heterologous DNA in cell nuclei such that said heterologous DNA is maintained in said cell 
nuclei as nuclear chromosomes are maintained. Said heterologous DNA may be 
contained in or may be an episomal element. Preferably, however, said heterologous DNA 
is stably incorporated in a nuclear chromosome such that it can be inherited to progeny 
cells or progeny plants. Methods of producing plants, plant parts, or plant cell cultures that 
are stably transformed are known in the art of plant biotechnology. 

Preferably, all cells of said plant, said plant part or said cell culture contain in their 
nuclei said heterologous DNA for giving a high yield of a protein of interest. More 
preferably, said cells contain said heterologous DNA stably integrated in a nuclear 
chromosome. In the case of a plant, this means that the plant is a transgenic plant. 

Said heterologous DNA having said sequence encoding said RNA replicon is 
operably linked or linkable to a transcription promoter. If said heterologous DNA is 
operably linked to said transcription promoter, the transcription promoter is preferably a 
regulated promoter, like an inducible, tissue-specific or developmentally regulated 
promoter in order to make expression of said sequence of interest regulatable. More 
preferably, said promoter is inducible or developmentally regulated, which allows that 
expression is induced at a desired time or that expression is switched on when the plant 
reaches a defined developmental stage, respectively. For example, expression of said 
sequence of interest can be switched on in plant seeds if said promoter is a seed-specific 
promoter. The invention is of high value for providing specific tissues (e.g. seed tissues) 
with RNA replicons. whereby said tissues may be less suited for cell-to-cell spread of said 
replicon than e.g. leaf tissue. Most preferred are chemically regulated promoters, since 
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they allow to switch on expression at will in all or in most tissues of a plant. Most 
importantly, chemically inducible promoters are the promoters of choice for large scale 
applications, since the chemical inducer can be applied to a large number of plants at the 
same time. Examples of regulated promoters are known in the art. 

Said sequence encoding an RNA replicon may alternatively be operably linkable to 
a transcription promoter, which allows to switch on expression of said sequence of interest 
by operably linking said sequence encoding an RNA replicon with a promoter. There are 
several ways of reducing this embodiment to practice. One option is to separate, in said 
heterologous DNA, said sequence encoding an RNA replicon and said promoter by a 
sequence block that precludes an operable linkage between said promoter and said 
heterologous DNA. Said sequence block may be flanked by recombination sites such that 
said block can be cut out by a recombinase recognizing said recombination sites. 
Thereby, operable linkage for transcription of said sequence encoding an RNA replicon 
can be established and expression may be switched on. Another option is to have a 
portion of a sequence necessary for transcription (e.g. a promoter or promoter portion) in 
flipped orientation and flanked by recombination sites. Providing a suitable recombinase 
may flip said sequence portion back in correct orientation, whereby an operable linkage 
can be established. 

In one embodiment of the invention, said sequence encoding an RNA replicon has 
one or more segments that code together for said RNA replicon, i.e. said RNA replicon is 
not encoded by one continuous DNA. Instead, said RNA replicon is encoded 
discontinuously by two or more segments, whereby said segments may be present on the 
same chromosome preferably contiguous to each other. Formation of said RNA replicon 
may then require rearrangement of said segments, e.g. by recombination. As an example, 
a part of a sequence for replicon function (e.g. a part of a sequence coding for a replicase) 
may be present in said heterologous DNA in a flipped orientation relative to other parts of 
such a sequence. The flipped part may be flanked by recombination sites. Then the 
transcript of the heterologous DNA will not be a replicon. since said replicon function 
cannot be provided (e.g. because the transcript does not code for a functional replicase). 
Providing a site-specific recombinase recognizing the recombination sites can flip one of 
said segments back such that a replicon function is encoded continuously. In this 
embodiment, providing the recombinase may function as a switch for switching on 
replicon formation and expression of a sequence of interest (see further below). 

Alternatively, said segments may be present on different chromosomes. Formation 
of an RNA replicon will then require transcription of both segments and trans-splicing of 
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both transcripts for assembling said RNA replicon. This embodiment may be used for 
quickly segregating the segments that encode together said RNA replicon in progeny 
plants or cells as described in detail in PCT/EP03/02986. 

The process of the invention comprises said steps (a) and (b). Step (a) may 
comprises stable or transient transformation of a plant, plant part or plant cell culture with 
said nucleic acid of the invention. As discussed above, stable transformation of a nuclear 
chromosome is preferred. Preferably, the process of the invention is a process of 
expressing a protein of interest encoded by said sequence of interest. Step (b) comprises 
causing expression of said sequence of interest, e.g. switching on said expressing. 
Various methods of causing or switching on expression have already been mentioned. 
Examples include inducing an inducible promoter operably linked to said heterologous 
DNA; bringing said heterologous DNA under operable linkage to a promoter using 
recombination; establishing continuous coding of a sequence for replicon formation using 
recombination etc. If a recombinase is used for switching on the process of the invention, 
said recombinase may be provided to said plant, plant part or plant cell culture transiently, 
whereby said providing would act as a switch for step (b). Alternatively, said recombinase 
may be stably encoded in cells, and expressing of the recombinase under control of a 
regulated, preferably inducible, promoter. Inducing recombinase expression by inducing 
said promoter may then cause expression in step (b). 

Preferably, the process of the invention is performed with many plants in parallel 
by providing many plants according to (a) and causing expression of said sequence of 
interest according to (b) with all plants in one step, e.g. by applying a chemical inducer for 
a chemically inducible promoter to all plants for example by spraying. 

The present invention may in principal be applied to any plants for which infectious 
RNA viruses exist. Suitable plant/RNA virus pairs may be derived from the list of RNA 
viruses given below. Due to the very high efficiency of replicon formation according to the 
invention, the plant species specificity of plant viruses is far less pronounced when this 
invention is practiced. Similarly, the present invention may be used with RNA replicons 
based on any RNA virus. RNA viruses have generally evolved outside the cell nuclei of 
their host plants and will have selected localities that make a replicon based on such a 
virus inefficient when the replicon is produced inside cell nuclei, notably if the replicon is 
stably encoded in a nuclear chromosome. The invention can be applied to all RNA 
viruses, although the level of improvement may vary between different plant RNA viruses. 
The most preferred plant RNA viruses the invention may be based on are tobamoviruses, 
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notably tobacco mosaic virus, and Potexviruses such as potato virus X. In the case of 
tobacco mosaic virus, it will generally be the coat protein that is replaced by said 
sequence to be expressed. The movment protein may be removed or replaced by a 
sequence to be expressed. Preferably, however, an RNA replicon derived from tobacco 
mosaic virus should code for the movement protein and have the coat protein be replaced 
by said sequence to be expressed. 

The major application of the present invention is the production of a protein of 
interest in plants, plant parts or plant cell cultures. If the process of the invention is 
performed in plants, plants that do not enter the human or animal food chain are 
preferred, like Nicotians species. Plant parts are e.g. plants organs or specific tissues of 
plants like leaves or seeds. Herein, seeds are considered as plant parts if the process of 
invention is done in seeds growing or being attached to a parent plant Seeds are, 
however, also considered to be plants, albeit in a certain developmental stage of a plant. 
Most preferably, the plants of the invention are sold or distributed as seeds, the seeds are 
grown to plants, and expression of said sequence of interest is induced or switched on at 
a desired point in said plants. 

BRIEF DESCRIPTION OF THE FIGURES 

Fig. 1 depicts the general principle of the invention, based on increased frequency of 
RNA virus-based replicon formation. 

Fig. 2 shows the intron prediction profile of transcribed region of vector plCH8543. 
Nucleotide numbers are given on the horizontal axis. The vertical axis shows the 
probability for corresponding sequence/sequence region to be a coding sequence 
(coding), to serve as donor site (Donor) or as acceptor site (Acceptor). Circled parts 
correspond to selected localities where said function conservative differences should be 
introduced. 

Fig. 3 shows the intron prediction profile of transcribed region of vector plCH15466, first 
half of the transcribed region. The circled regions were modified (compare Fig. 2A) as 
described in this invention. 

Fig. 4 shows the intron prediction profile of transcribed region of plCH1590, second half of 
said region. The circled regions were modified (compare with Fig. 2B) as described in this 
invention. 

Fig. 5 shows the intron prediction profile of transcribed region of plCH15499. The circled 
regions correspond to six inserted plant nuclear introns. 
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Fig. 6 depicts schematic representations of viral constructs vectors unmodified and 
modified according to the scope of said invention. 

Fig. 7 shows GFP expression after agroinfiltration of the viral constructs in Nicotiana 
benthamiana and Nicotiana tabacum leaves. The vector identification number for each 
infiltrated area is indicated. 

A - Nicotiana benthamiana, 8 days after agroinfiltration; 
B - Nicotiana tabacum, 8 days after agroinfiltration; 

C - Nicothiana benthamiana protoplasts isolated 5 days after agroinfiltration. Many light 
spots in the right picture indicate an extremely high frequency of replicon formation and 
GFP expression. 

Fig. 8 depicts the schematic presentation of RNA virus-based replicon precursor designed 
according to the present invention, which gives zero expression level of the gene of 
interest (GFP, indicated by G) in the non-induced state. 

P - transcription promoter, T - transcription termination region; SM - selectable marker 
gene; Ac2 - promoter of Arabidopsis ACTIN2 gene; RdRP viral RNA-dependent RNA 
polymerase; MP - viral movement protein; NTR - viral 3' non-translated region. 
Fig. 9 shows the intron prediction profile for Arabidopsis thaliana meiosis-specific gene 
AtDMCI (GenBank Acc.No U76670), using the direct strand (+ strand). The intron-coding 
regions are circled. 

Fig. 10 (A, B) shows the prediction of potential problematic regions (circled) within the 
direct strand (+ strand) of Potato Virus X (PVX) genome (GenBank Acc. No. AF172259). 
Fig. 11 shows in A, B, and C the prediction of potential problematic regions (circled) of the 
direct strand (+ strand) of alfalfa mosaic virus genomes of RNA1 (GenBank Acc. No 
K02703), RNA2 (GenBank Acc. No K02702) and RNA3 (GenBank Acc. No L00163), 
respectively. 
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DETAILED DESCRIPTION OF THE INVENTION 

We have surprisingly found that the incorporation of plant introns into certain 
regions of plant viral RNA vectors and removal or replacement of cryptic introns within 
sequences for replicon function can dramatically increase (at least x10 2 folds) the 
efficiency of said replicons in host plants. Such increase in efficiency was reflected in at 
least one easily measurable parameter relative proportion of cells showing replication of 
said vector, e.g. in increased frequency of replicon formation. Such optimisation of 
initiation of RNA replicon formation led to the ability of synchronized switching on of 
expression of a sequence of interest in a whole plant, resulting in dramatically increased 
yield of recombinant protein of interest in shorter time than for a non-modified vector. 

Despite of publications concerning the increase of nuclear transgene expression 
by incorporation of introns in coding regions of recombinant DNA (Mascarenhas et ai, 
1990, Plant Mol. Biol., 15, 913-920; Bourdon et aL, 2001, EMBO Reports, 2, 394-398; 
Rose, AB., 2002, RNA, 8, 1444-1453; US5,955,330), there is no hint showing that 
incorporation of introns into viral RNA replicons would have any positive effect on the 
frequency of viral replicon formation and subsequently, on the level of expression of a 
sequence of interest provided by said replicon. This effect is surprising considering that 
nuclear mRNA transcription and viral RNA replication take place in different sub-cellular 
compartments. Even if the cDNA copy of a viral replicon is placed in the nucleus, only the 
first copy of the viral replicon precursor is produced in nucleus and then amplified in the 
cytoplasm under conditions different from those in the nucleus. In the prior art, the use of 
introns for preventing the cytotoxic effect of "leaky" expression of viral genes in E.coli 
during cloning with wild type virus cDNAs was described (Johansen, I.E. 1996, Proc. Natl. 
Acad. ScL USA, 93, 12400-12405; Yang et aL, 1998, Arch. Virol., 143, 2443-2451; Lopez- 
Moya & Garcia, 2000, Virus Res., 68. 99-107). There is no hint that intron inclusion can 
increase the frequency of replicon formation from a viral cDNA clone. For example, there 
was no difference in infectivity between intron-less, intron-containing cDNA and wild type 
lettuce mosaic virus (LMV) (Yang et al., 1998, Arch. Virol., 143, 2443-2451). In addition, 
the intron-containing LMV cDNA showed delayed establishment of infection. No difference 
in infectivity between intron-less and intron-containing cDNAs was found in the case of 
plum pox potyvirus (PPV) (Lopez-Moya & Garcia, 2000, Virus Res., 68, 99-107). The 
publication of Johansen (1996, Proc. Natl. Acad. Sci. USA, 93, 12400-12405) contains 
non valid comparative data concerning this matter. In addition, the results obtained for 
wild type RNA viruses and their cDNA copies cannot be compared with virusnderived 
expression vectors designed for the expression of a sequence of interest in plants, 
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predominantly at the expense of other properties of wild type viruses like high infectivity 

and stability of said viruses. 

The present invention provides a method for increasing fundamentally the 
frequency of RNA virus-derived replicon formation, said replicons are derived upon 
transcription of DNA precursor and designed for the expression of sequences of interest. 
Said method overcomes the limitations of existing viral vector-based expression systems, 
such as size limitation for heterologous sequences to be expressed and high instability of 
said vectors. Further, said method offers better biosafety characteristics, allows to design 
leakage-proof control over the transgene expression (zero expression level in non- 
induced state), as such design can be an integrated part of the strategy for said RNA 
virus-derived replicon design. By providing high frequency of RNA virus-derived replicon 
formation, the approach described herein allows for a rapid initiation of the expression of 
a sequence of interest in a whole plant, part of plant or plant cell culture containing in cell 
nuclei a heterologous DNA encoding said RNA replicon. By practicing the invention, the 
performance of practically any plant RNA virus-derived replicon designed for the 
expression of a heterologous sequence of interest can be improved significantly through 
dramatic increase of the frequency of replicon formation. 

RNA viruses belonging to different taxonomic groups are suitable for constructing 
RNA replicons according to this invention. A list of RNA viruses to which this invention can 
be applied.is presented below. Taxa names in quotes (and not in italic script) indicate that 
this taxon does not have an ICTV international approved name. Species (vernacular) 
names are given in regular script. Viruses with no formal assignment to genus or family 
are indicated): 

RNA Viruses: 

ssRNA Viruses: Family: Bromoviridae, Genus: Alfamovirus, Type species: alfalfa 
mosaic virus, Genus: llarvirus. Type species: tobacco streak virus, Genus: Bromovirus, 
Type species: brome mosaic virus, Genus: Cucumovirus, Type species: cucumber 
mosaic virus; 

Family: Closteroviridae, Genus: Closterovirus, Type species: beet yellows 
virus, Genus: CrinMrus, Type species: Lettuce infectious yellows virus, Family: 
Comoviridae, Genus: Comovirus, Type species: cowpea mosaic virus, Genus: 
Fabavirus, Type species: broad bean wilt virus 1. Genus: Nepovirus, Type species: 
tobacco ringspot virus; 
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Family: Potyviridae, Genus: Potyvirus, Type species: potato virus Y, Genus: 
Rymovirus, Type species: ryegrass mosaic virus, Genus: Bymovirus, Type species: 

barley yellow mosaic virus; 

Family: Sequiviridae, Genus: Sequivirus, Type species: parsnip yellow fleck 
virus, Genus: Waikavirus, Type species: rice tungro spherical virus; Family: 
Tombusviridae, Genus: Carmovirus, Type species: carnation mottle virus, Genus: 
Dianthovirus, Type species: carnation ringspot virus, Genus: Machlomovirus, Type 
species: maize chlorotic mottle virus, Genus: Necrovirus, Type species: tobacco 
necrosis virus, Genus: Tombusvirus, Type species: tomato bushy stunt virus, 
Unassigned Genera of ssRNA viruses, Genus: Ca P illovirus,Type species: apple stem 
grooving virus; 

Genus: Carlavirus, Type species: carnation latent virus; Genus: Enamovirus, 
Type species: pea enation mosaic virus, 

Genus: Furovirus, Type species: soil-borne wheat mosaic virus, Genus: 
Hordeivirus, Type species: barley stripe mosaic virus, Genus: Idaeovirus, Type 
species: raspberry bushy dwarf virus; 

Genus: Luteovirus, Type species: barley yellow dwarf virus; Genus: Marafivirus, 
Type species: maize rayado fino virus; Genus: Potexvirus, Type species: potato virus 
X;Genus: Sobemovirus, Type species: Southern bean mosaic virus, Genus: Tenuivirus, 
Type species: rice stripe virus, 

Genus: Tobamovirus, Type species: tobacco mosaic virus, 

Genus: Tobravirus, Type species: tobacco rattle virus, 

Genus: Trichovirus, Type species: apple chlorotic leaf spot virus; Genus: 
Tymovirus, Type species: turnip yellow mosaic virus; Genus: Umbravirus, Type 
species: carrot mottle virus; Negative ssRNA Viruses: Order: Mononegavirales, 
Family: Rhabdoviridae, Genus: Cytorhabdovirus, Type Species: lettuce necrotic 
yellows virus, Genus: Nucleorhabdovirus, Type species: potato yellow dwarf virus; 

Negative ssRNA Viruses: Family: Bunyaviridae, Genus: Tospovirus, Type 
species: tomato spotted wilt virus; 

dsRNA Viruses: Family: Partitiviridae, Genus: Alphacryptovirus, Type species: 
white clover cryptic virus 1, Genus: Betacryptovirus, Type species: white clover cryptic 
virus 2, Family: Reoviridae, Genus: Fijivirus, Type species: Fiji disease virus, Genus: 
Phytoreovirus, Type species: wound tumor virus, Genus: Oryzavirusjype species: 
rice ragged stunt virus; 

Unassigned Viruses: 
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Genome: ssRNA, Species Garlic viruses A.B.C.D. Species grapevine fleck 
virus, Species maize white line mosaic virus, Species olive latent virus 2, Species: 
ourmia melon virus, Species Pelargonium zonate spot virus. 

The general principle of the invention is shown in Figure 1. It is known that plant 
RNA viruses (an exception are viroids - small non-coding RNAs amplifying in plant cell 
nuclei - for a review see Diener, T.O.. 1999, Arch. Virol. Suppl., 15, 203-220; Flores. R., 
2001, CR Acad. Sci. Ill, 324, 943-952) never occur in the plant nucleus, but in the 
cytoplasm. Therefore, the sequences of RNA viruses might not be adapted to withstand 
nuclear RNA processing events due to the presence of motifs that might be involved in 
complex series of processing steps including transport of processed RNA in cytoplasm, in 
which pre-mRNAs, rRNA and tRNA precursors are involved. The processing events, such 
as 5' end capping, splicing, 3* end generation, polyadenylation, degradation, base and 
sugar modification as well as editing (in plastids and mitochondria) are intensively studied. 
However, many elements of such events still remain unclear. The most dramatic changes 
to pre-mRNA in the nucleus happen during pre-mRNA splicing, the process by which 
intervening RNA sequences (introns) are removed from the initial transcript and exons are 
concomitantly ligated. Splicing is mediated by the splicesome, a complex structure 
comprising uridilate-rich small nuclear ribonucleoprotein particles. The splicesome carries 
out the splicing reaction in two consecutive steps: the first one - cleavage at the 5' splice 
site of upstream exon/intron junction leading to lariat formation, and second step - 
cleavage at the 3* splice site of intron/downstream exon junction followed by upstream 
and downstream exons ligation (for review see: Kramer, A., 1996, Annu.Rew. Biochem., 
65, 367-409; Simpson.GG. & Filipowicz, W. 1996, Plant Mol. Biol., 32, 1-41). The 5' and 
3' splice site dinucleotides (57GU; AG/3') flanking the intron sequences are highly 
conserved in higher plants and single G replacement might abandon the splicing activity 
at the site concerned. Surprisingly, but despite high conservation of splice sites between 
plants and animals, heterologous introns in plants are usually not spliced or spliced 
incorrectly (van Santen, VL. et al : , 1987, Gene, 56^ 253-265; Wiebauer, K., Herrero, J. J., 
Filipowicz, W. 1988, Mol. Cel. Biol., & 2042-2051). Considering that plant viral RNAs were 
not under evolutionary pressure to resist nuclear RNA processing machinery, these RNAs 
are very likely to become subject of such processing, including splicing, once they are 
placed into the nuclear environment. This situation is completely different from that of 
RNA transcripts encoded by nuclear genes, as the latter transcripts are evolutionary 
adapted to preserve their functionality, despite of series of RNA modifications taking place 
in the nucleus. However, such modifications can have dramatic consequences for viral 
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RNA replicon formation. Re-engineering of the plant virus in order to make expression 
vectors for heterologous genes might further add to the instability of RNA virus-based 
replicons, as it would add further elements that might interact with RNA sequences of viral 
origin, producing defective RNA that is unable to replicate. Our invention addresses these 
problems by subjecting the expression vector to modifications that significantly increase 
the frequency of functional RNA replicon formation, when the expression vector is 
introduced as a DNA precursor into plants or plant cells to provide for transient expression 
or for stable integration into plant chromosomal DNA. We believe that modifications of 
virus-derived sequences shall be the most profound solution for increasing the efficiency 
of RNA virus-based replicons. The modifications might be direct (e.g. within said RNA 
virus-derived sequences) or indirect (e.g. within the sequences of non-viral origin), but 
said indirect modifications may have a stabilizing effect on the sequences of viral origin. In 
this invention we predominantly focus on modifications within the RNA virus derived 
sequences, as they are crucial for increasing the efficiency of RNA replicon formation. 

Surprisingly, our first attempt to find evidence that potentially problematic regions 
do exist, was successful and even more surprisingly, we obtained experimental 
confirmation by finding unexpectedly an improvement of orders of magnitude. An analysis 
of the sequence derived from the RNA virus of expression vector plCH8543 (EXAMPLE 1, 
Fig. 6A) using the Netgenell server program (http://www.cbs.dtu.dk/services/NetGene2/) 
for the presence of cryptic introns and RNA splicing sites showed the presence of intron- 
like regions that might be spliced by the nuclear RNA processing machinery (see circled 
regions in Fig. 2). There are many other programs that can be used to identify potentially 
problematic regions (said selected localities) within plant viral RNA sequences, such as 
exon/intron prediction program (http://genes.mit.edu/GENSCAN.html) or splicing signal 
prediction program (http://125.itba.mi.cnr.it/-webgene/wwwspliceview.html) for variety of 
organisms. 

Considering that all existing programs are not ideal and are subject to mistakes, 
the potential problematic regions can also be determined experimentally. This can be 
done by analyzing the transcripts derived from DNA vector under test in an nuclear 
environment with the help of such a routine technique as RT-PCR (Frohman, MA, 1989, 
Methods Enzymol., 218, 340-356) or its more advanced version suitable for precise 
quantification of the concentration of different transcripts called real-time PCR (Gibson et 
al., 1996, Genome Res., 6, 995-1001), preferably followed by sequencing of the PCR- 
amplitled products, the function-conservative differences of the invention change 
dramatically the RNA profile, for example by replacing intron-like sequences with exon-like 
ones, e.g. by introducing silent mutations with replacement of A/U-rich regions (intron-like) 
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with G/C-rich regions (exon-like) (see Figure 3, circled regions). Plant introns, unlike 
exons, are usually A/U rich (Csank, C. et ai, 1990, Nucl. Acid Res., 18, 5133- 
5141;Goodall & Filipowicz, 1989, Cell, 58, 473-483), but there are exceptions, for example 
when in monocotyledonous plants G/C rich introns were found (Goodall & Filipowicz, 
1989, Cell, 58, 473-483; Goodall & Filipowicz, 1991, EMBO J., 10, 2635-2644). For 
practicing this invention, the A/U rich regions include not only prolonged stretches of the 
sequences of at least 20 nucleotides in length with at least 55% or more of A/U content, 
but also shorter stretches ("islands") of 6-19 nucleotides in a row of purely A/U-containing 
sequences. For the avoiding any misunderstanding, the A/U content within this invention 
means that all sequences which are more A- than U-rich, or only A-rich and vice-versa are 
covered by this definition. In EXAMPLE 2 we demonstrate that the modification of A/U rich 
region increases the number of GFP expressing cells at least 10-folds. This is clearly 
demonstrated in Figure 7 by comparing the areas agroinfiltrated with plCH15466 
(modified vector, Fig 6A) and plCH14833 (control vector, Fig. 6A). Removing the 
movement protein (MP) allows for an accurate count of primary cells possessing 
functional RNA replicons, as the cell-to-cell movement from the site of primary infection to 
neighbouring cells does not take place. In EXAMPLE 3, the modification of another U-rich 
intron-like region containing many cryptic splice sites (Fig.2B) and covering the 
subgenomic promoter of the movement protein (MP) was performed (Fig. 4, circled). This 
modification gave a dramatic effect on the increase of the frequency of replicon formation 
from viral vector plCH1590. As it was established in protoplasts counting experiments 
(EXAMPLE 3), the increase was approximately 100-fold in comparison with unmodified 
vector plCH14833 for both tested Nicotiana species - N. benthamiana and N. tobacco 
(see the corresponding infiltrated areas in Figure 7, A, B). In general, by using the 
approaches described in this invention, we could increase the frequency of RNA replicon 
formation approx. 300-fold, i.e. increasing the proportion of cells with functional replicons 
from about 0.2% (control vector) to more than 50% (modified vector). We believe this is 
not the limit and reaching a frequency of 100% is very realistic. 

Such a high efficiency of replicon formation opens the door for expressing two or 
more different genes from two different RNA replicons within the same plant cell, e.g. co- 
expressing different genes by using plant RNA virus based vectors. The achievement of 
synchronized release of two or more replicons at same time in the same cell is crucial for 
such co-expression, as the principle "first come, first served" is especially true for viral 
vectors. Systemic or cell-to-cell movement does not help, as different viral vectors do 
normally not overlap in their areas of spread or such overlap is insignificant. Simple 
calculations demonstrate the importance of the technology described in this invention for 
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achieving co-expression of two sequences of interest in the same plant cell from two 
replicons. In the case of a non-optimised viral vector with a frequency of functional 
replicon formation of only 0.2% of all cells, the proportion of cells co-expressing two genes 
from two different RNA replicons will be 0.2x0.2=0.04%, while for the construct with 
increased frequency of functional RNA replicon formation (50% or 1/2 of all cells), said 
proportion of cells will be 0.5X0.5=0.25 or 25%, e.g. about 625-fold higher. With some of 
the best performing vectors (e.g. plCH16191, Fig. 7C) the proportion of cells having the 
functional replicon reaches ca. 90% (Fig. 7C, top right). This, means that using such a 
vector for expressing two different sequences of interest from two independent replicons, 
co-expression can take place in 80% of all cells. It appears very likely that the technology 
can be further improved and that 100% co-expression can be reached. 

It is worth to note that function-conserative differences in a heterologous 
sequences of interest to be expressed from said RNA replicon might also be used to 
increase the frequency of RNA replicon formation, notably in combination with differences 
in sequences for replicon function. For example, modifications within said sequences of 
interest can be introduced that are necessary for formation and/or processing of said 
replicon. 

In another embodiment of this invention, the frequency of replicon formation was 
improved by inserting nuclear introns in said sequences for replicon function (EXAMPLE 
4). The incorporation of introns into the coding region of viral RNA-dependent RNA 
polymerase (RdRP) (EXAMPLES 4 and 8) resulted in a significant (at least 50-fold) 
increase in the frequency of replicon formation from (Fig. 7A.B) modified vectors 
(PICH15034, plCH15025, plCH15499 in Fig. 6 A,B). The RNA profile for a vector 
containing 6 inserted introns from Arabidopsis is shown in Figure 5. In another example 
(EXAMPLE 7), insertion of introns in MP sequences increases the frequency of replicon 
formation at least 1 00 times. 

Many nuclear introns can be used to practice this invention. Examples of such 
introns include but are not limited to the introns from rice tpi Acff.and salT genes 
(Rethmeier et al., 1997, Plant J., 12, 895-899; Xu et al, 1994, Plant Physiol., 100. 459- 
467; McElroy ef al., 1990, Plant Cell, 2> 163-171); from the maize Adh1. GapA1. actin and 
Bz1 genes (Callis etal., 1987. Genes Dev., 1, 1183-11200; Donath et al., 1995, Plant Mol. 
Biol., 28, 667-676; Maas et al., 1991, Plant Mol. Biol., 16, 199-207; Sinibaldi &Mettler. 
1992, in WE Cohn. K Moldave. eds. Progress in Nucleic Acids Research and Molecular 
Biology, vol 42. Academic Press. New York. pp229-257), from petunia rubisco gene 
SSU301 (Dean et al., 1989, Plant Cell, !L 201-208), Arabidopsis A1 EF1D, UBQ10, 
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UBQ3, PAT1 genes (Curie ef a/.,1993, Mol. Gen. Genet. 228, 428-436; Norris et al., 
1993, Plant Mol. Biol., 21, 895-906; Rose & Last, 1997, Plant J., 11.455-464) and many 
others. Synthetic introns can also be used for this invention. The smallest usable introns 
or their parts may be limited to splice donor and acceptor sites which usually flank the 
internal intron sequences. Preferably, the introns should have a size of at least 50 nt, 
more preferably a size of 100 to 200 nt., but actually there are no limitations regarding the 
size of the introns. However, the size of the construct should be kept suitable for 
manipulations. The origin of the intron, its structure and size may be selected individually 
depending on the nature of the vector. Transient expression experiments may be used for 
testing the efficiency of a chosen intron or the corresponding intron parts. 

The modifications described above have a cumulative effect, e.g. if intron 
insertion(s) are combined with a modification of the MP subgenomic promoter, the 
increase in frequency of replicon formation can be approx. 300-fold (EXAMPLE 5). The 
preferred regions for intron insertions in order to have an increase in the frequency of 
RNA replicon formation are called selected localities herein. Such localities may contain 
"intron-like" structures. This is confirmed by the insertion of introns in MP, actually in close 
proximity to such a problematic region as the MP subgenomic promoter (EXAMPLE 7). A 
100-fold increase in frequency of replicon formation was observed. Insertion of introns into 
"exon-like" regions does not have such a pronounced effect as insertion in said intron-like 
regions (EXAMPLE 6). 

The experiments discussed above were done with transient expression systems 
based on Agrobacterium-mediated DNA precursor delivery into plant cells. However, the 
most useful application of this invention will be for transgenic plants with a DNA precursor 
of said RNA replicon stably incorporated into a plant nuclear chromosome. This allows to 
overcome many limitations of plant viral vector-based systems, such as the restrictions to 
the maximal size of heterologous sequences viral vectors can tolerate. As the DNA 
precursor will be present in each cell of the transgenic plant, there is no absolute 
requirement for systemic movement or for cell to cell movement of the RNA replicon 
(replicon spreading). This can be compensated by the high efficiency of formation and 
transport of the RNA replicons of the invention into the cytoplasm. However, the ability of 
the vector for cell-to-cell movement can be of an additional value, as RNA replicon 
formation does not always occur in a|l cells. 



Different methods may be used for providing a plant cell with said heterologous 
DNA. Said vectors may be transformed into plant cells by a Ti-plasmid vector carried by 
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Agrobacterium (US 5.591,616; US 4.940,838; US 5,464,763) or particle or microprojectile 
bombardment (US 05100792; EP 00444882B1; EP 00434616B1). Other plant 
transformation methods can also be used like microinjection (WO 09209696; WO 
09400583A1; EP 175966B1). electroporation (EP00564595B1 ; EP00290395B1; WO 
0870661 4A1) or PEG-mediated transformation of protoplasts etc. The choice of the 
method for vector delivery may depend on the plant species to be transformed. For 
example, microprojectile bombardment is generally preferred for monocot transformation, 
while for dicots, Agrobacfer/um-mediated transformation gives better results in general. 

In the examples described below, we used Agrobacterium-mediated delivery of 
vectors (said heterologous DNA) into Nicotiana cells. However, said vectors may be 
introduced into the plants in accordance with any of the standard techniques suitable for 
stable or transient transformation of the plant species of interest. Transformation 
techniques for dicotyledonous are well known in the art and include Agrobacferium-based 
techniques and techniques which do not require Agrobacterium. Hon-Agrobacterium 
techniques involve the uptake of exogenous genetic material directly by protoplasts br 
cells. These techniques include PEG or electroporation mediated uptake, particle 
bombardment-mediated delivery and microinjection. Examples of these techniques are 
described in Paszkowski et a/., EMBO J 3, 2717-2722 (1984), Potrykus ef a/.. Mot. Gen. 
Genet. 199.169-177 (1985), Reich ef a/.. Biotechnology 4:1001-1004 (1986), and Klein et 
a/., Nature 327,70-73 (1987). In each case, the transformed cells are regenerated to 
whole plants using standard techniques. 

Agrobacter/um-mediated transformation is a preferred technique for the 
transformation of dicotyledons because of its high transformation efficiency and its broad 
utility with many different species. The many crop species which may be routinely 
transformed by Agrobacterium include tobacco, tomato, sunflower, cotton, oilseed rape, 
potato, soybean, alfalfa and poplar (EP 0 317 511 (cotton), EP 0 249 432 (tomato), WO 
87/07299 {Brassica), U.S. Patent 4,795,855 (poplar)). 

In the examples of this invention, in parallel with stable agro-transformation we 
used agro-inoculation, method of Agrobacferium-mediated delivery of T-DNAfor transient 
expression of gene(s) of interest (Vaquero et a!., 1999, Proc. Natl. Acad. Sci. USA, 96, 
11128-11133). Said agro-inoculation is an extremely useful tool not only for small-to- 
middle scale recombinant protein production systems, but as one of the elements of 
vectors optimisation system, allowing to obtain fast results with different variants of 
constructs. 



In EXAMPLE 9 of this invention, a DNA precursor of an inactivated viral 



22 

based replicon is stably incorporated into chromosomal DNA. Said replicon is optimised 
according to the invention. In addition, the replicon contains a structure preventing 
expression of the sequence of interest. Expression as well as formation of the functional 
RNA replicon can be triggered by flipping one part of the construct with the help of site- 
specific recombination. Said flipping can lead to the formation of two introns as well as to 
the assembly of a functional sequence of interest. The system described in EXAMPLE 9 
shows not only the optimisation of a viral vector but also the solution for avoiding 
"leakiness" of constructs stably integrated into chromosomal DNA, including the "leaky- 
expression of the gene of interest from said construct. In many applications, it is crucial to 
have zero level expression in the uninduced state, especially for cytotoxic proteins or for 
achieving high biosafety standards with plant expression systems for expressing technical 
or pharmaceutical proteins. 

Transcription of the heterologous DNA and/or of said recombinase can be under 
the control of an inducible or any other regulated (e.g. developmental^ regulated) 
promoter. Inducible promoters can be divided into two categories according to their 
induction conditions: those induced by abiotic factors (temperature, light, chemical 
substances) and those that can be induced by biotic factors, for example, pathogen or 
pest attack. Examples of the first category are heat-inducible (US 05187287) and cold- 
inducible (US05847102) promoters, a copper-inducible system (Mett et al., 1993, Proc. 
Natl. Acad. Sci., 90, 4567-4571 ), steroid-inducible systems (Aoyama & Chua, 1997, Plant 
J., 11. 605-612; McNellis et al., 1998, Plant J., 14, 247-257; US06063985), an ethanol- 
in'ducible system (Caddick et al., 1997, Nature Biotech., 16, 177-180; WO09321334), and 
a tetracycline-inducible system (Weinmann et al., 1994, Plant J., 5, 559-569). One of the 
latest developments in the area of chemically inducible systems for plants is a chimaeric 
promoter that can be switched on by glucocorticoid dexamethasone and switched off by 
tetracycline (Bohner et al., 1999, Plant J., 19, 87-95). For a review on chemically inducible 
systems see: Zuo & Chua, (2000, Current Opin. Biotechnol., VL 146-151) and Padidam, 
M (2003, Curr. Opin. Plant Biol., 6, 169-177). Other examples of inducible promoters are 
promoters which control the expression of patogenesis-related (PR) genes in plants. 
These promoters can be induced by treatment of a plant with salicylic acid, an important 
component of plant signaling pathways in response to pathogen attack, or other chemical 
compounds (benzo-1 ,2,3-thiadiazole or isonicotinic acid) which are capable of triggering 
PR gene expression (US05942662). 

This invention is not limited to TMV-based vectors described in examples 1 - 9, but 
can be extended to replicons based on other plant RNA viruses. The analysis of other 
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plant viral RNA sequences (EXAMPLE 10. Figures 10. 11) shows selected localrt.es very 
similar to those described for TMV and the sequences of pre-mRNA of plant nuclear 
genes (Fig. 9). This is strong evidence supporting the suggestion that, using the 
approaches described in this invention, practically any plant RNA virus-derived replicon 
can be improved fundamentally by removing/replacing problematic regions and/or 
inserting nuclear introns. 

Genes of interest, their fragments (functional or non-functional) and their artificial 
derivatives that can be expressed in plants or plants cells using the present invention 
include, but are not limited to: starch modifying enzymes (starch synthase, starch 
phosphorylation enzyme, debranching enzyme, starch branching enzyme, starch 
branching enzyme II. granule bound starch synthase), sucrose phosphate synthase, 
sucrose phosphorylase, polygalacturonase, polyfructan sucrase. ADP glucose 
pyrophosphorylase. cyclodextrin glycosyltransferase. fructosyl transferase, glycogen 
synthase, pectin esterase, aprotinin. avidin. bacterial levansucrase. E.coli glgA protein. 
MAPK4 and orthologues, nitrogen assimilation/methabolism enzyme, glutamine synthase, 
plant osmotin. 2S albumin, thaumatin, site-specific recombinase/integrase (FLP, Cre. R 
recombinase, Int. SSVI Integrase R. Integrase phiC31, or an active fragment or variant 
thereof), oil modifying enzymes (like fatty acids desaturases. elongases etc), isopentenyl 
transferase. Sea M5 (soybean calmodulin), coleopteran type toxin or an insecticidally 
active fragment, ubiquitin conjugating enzyme (E2) fusion proteins, enzymes that 
metabolise lipids, amino acids, sugars, nucleic acids and polysaccharides, superoxide 
dismutase. inactive proenzyme form of a protease, plant protein toxins, traits altering fiber 
in fiber producing plants, Coleopteran active toxin from Bacillus thuringiensis (Bt2 toxin, 
insecticidal crystal protein (ICP), CrylC toxin, delta endotoxin, polyopeptide toxin, protoxin 
etc.). insect specific toxin AalT, cellulose degrading enzymes. E1 cellulase from 
Acidothermus celluloticus, lignin modifying enzymes, cinnamoyl alcohol dehydrogenase, 
trehalose-6-phosphate synthase, enzymes of cytokinin metabolic pathway, HMG-CoA 
reductase. E. co// inorganic pyrophosphatase, seed storage protein, Erwinia herbicola 
lycopen synthase. ACC oxidase. pTOM36 encoded protein, phytase. ketohydrolase. 
acetoacetyl CoA reductase, PHB (polyhydroxybutanoate) synthase, enzymes involved in 
the synthesis of polyhydroxylalkanoates (PHA), acyl carrier protein, napin. EA9. non- 
higher plant phytoene synthase. pTOM5 encoded protein, ETR (ethylene receptor), 
plastidic pyruvate phosphate dikinase. nematode^inducible transmembrane pore protein, 
trait enhancing photosynthetic or plastid function of the plant cell, stilbene synthase, an 
enzyme capable of hydroxylating phenols, catechol dioxygenase. catechol 2,3- 
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dioxygenase, chloromuconate cycloisomerase, anthranilate synthase, Brassica AGL15 
protein, fructose 1 ,6-biphosphatase (FBPase), AMV RNA3, PVY replicase, PLRV 
replicase. potyvirus coat protein, CMV coat protein, TMV coat protein, luteovrus 
replicase, MDMV messenger RNA; mutant geminiviral replicase. Umbellularia calrforruca 
C12:0 preferring acyl-ACP thioesterase, plant C10 or C12:0 preferring acyl-ACP 
thioesterase, C14:0 preferring acyl-ACP thioesterase (luxD), plant synthase factor A, plant 
synthase factor B, D6-desaturase, proteins having an enzymatic activity in fatty ac.ds 
biosynthesis and modifications, e.g. the peroxysomal p-oxidation of fatty acids in plant 
cells, acyl-CoA oxidase, 3-ketoacyl-CoA thiolase, lipase, maize acetyl-CoA-carboxylase, 
etc- 5-enolpyruvylshikimate-3-phosphate synthase (EPSP), phosphinothricin acetyl 
transferase (BAR. PAT), CP4 protein, ACC deaminase, protein having posttranslational 
cleavage site. DHPS gene conferring sulfonamide resistance, bacterial nitrilase, 2,4-D 
monooxygenase, acetolactate synthase or acetohydroxyacid synthase (ALS, AHAS), 
polygalacturonase, Taq polymerase, bacterial nitrilase. many other enzymes of bactenal 
or phage origin including restriction endonucleases. methylases. DNA and RNA hgases. 
DNA and RNA polymerases, reverse trascryptases. nucleases (Dnases and RNAses), 
phosphatases, transferases etc. 

The present invention can be used for the purpose of molecular farming and 
purification of commercially valuable and pharmaceutical^ important proteins including 
industrial enzymes (cellulases. lipases, proteases, phytases etc.) and fibrous prote.ns 
(collagen, spider silk protein, etc.). Human or animal health protein may be expressed and 
purified using described in our invention approach. Examples of such proteins of .nterest 
include inter alia immune response proteins (monoclonal antibodies, single cha.n 
antibodies, T cell receptors etc.). antigens including those derived from pathogen.c 
microorganisms, colony stimulating factors, relaxins. polypeptide hormones includ.ng 
somatotropin (HGH) and proinsulin, cytokines and their receptors, interferons, growth 
factors and coagulation factors, enzymatically active lysosomal enzyme, fibnnolytic 
polypeptides, blood clotting factors, trypsinogen. a 1 -antitrypsin (AAT), human serum 
albumin, glucocerebrosidases, native cholera toxin B as well as function-conservative 
proteins like fusions, mutant versions and synthetic derivatives of the above prote.ns. 
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EXAMPLES 

The following examples are presented to illustrate the present invention. Modificatoons and 
variations may be made without departing from the spirit and scope of the invention. 



EXAMPLE 1 

Construction of a TMV-based RNA vector: 



Cloned cDNAs of the crucifer-infecting tobamovirus (cr-TMV; Dorokhov et al., 
1994, FEBS Lett. 350, 5-8) and of the turnip vein-clearing virus (TVCV; Lartey et al., 1994, 
Arch. Virol. 138, 287-298) were obtained from Prof. Atabekov from Moscow University, 
Russia. A viral vector containing a green fluorescence protein (GFP) gene was made in 
several cloning steps. The resulting construct, plCH8543 (Fig. 6A), contains in sequential 
order, a 787 bp fragment from the Arabidopsis actin 2 promoter {ACT2, ref An et al, 1996, 
GenBank accession AB026654, bp 57962 to 58748), the 5" end of TVCV (GenBank 
accession BRU03387, bp 1 to 5455), a fragment of cr-TMV (GenBank accession Z29370, 
bp 5457 to 5677, with thymine 5606 changed to cytosine to remove the start codon of the 
coat protein, CP), sequences "taa teg ata. act cga g", a synthetic GFP (sGFP) gene, cr- 
TMV 3' nontranslated region (3' NTR; GenBank accession Z29370, bp 6078 to 6312), 
and finally the nopaline synthase (Nos) terminator. The entire fragment was cloned 
between the T-DNA left (LB) and right (RB) borders of pICBVIO, a Carb R pBIN19-derived 
binary vector. plCH8543 was transformed into Agrobacterium strain GV3101 and 
infiltrated into a Nicotiana benthamiana leaf. Foci of GFP fluoresence that appeared at 3 
dpi grew and became confluent. Surprisingly, even though most cells in the infiltrated area 
finally expressed GFP due to viral replication and movement, only a fraction of the cells 
initiated viral replication, as detected by a number of independent GFP expressing foci. It 
becamce clear that the limiting factor is not DNA delivery to plant cells, since infiltration of 
Nicotiana benthamiana leaves with a GFP gene under control of the 35S promoter leads 
to GFP expression in almost every cell in the infiltrated area (not shown). 

To confirm this observation, we made a viral vector construct containing a 
mutation in the MP. This construct, called plCH14833. is similar to plCH8543 but differs 
by a deletion of 389 bp in the MP gene, upstream of the EcoRI site present in the MP. The 
sequence of the Ncol to EcoRI fragment that includes this deletion is given in the annex 
as SEQ ID No. 1. The entire viral construct (from the ACT2 promoter to the Nos 
terminator) was cloned between the T-DNA left and right borders of plCBV49, a P BIN19- 
derived Kan R binary vector. Due to the deletion in the MP, replicons produced from this 
construct cannot move from cell to cell but are able to replicate autonomously within a 



26 

cell. Cell to cell movement can be restored when MP is provided in trans, e.g. from a 
constitutive promoter such as the cauliflower mosaic virus 35S promoter. 

To make an MP expression construct, the TVCV MP gene was amplified by PCR 
from cloned TVCV cDNA (GenBank accession Z29370, bp 4802 to 5628) and subcloned 
in a binary vector under control of the 35S promoter. The resulting construct, called 
plCH10745 (not shown), and plCH14833 were transformed into Agrobacterium strain 
GV3101 and were infiltrated in Nicotiana benthamiana leaves. Infiltration of plCH14833 
alone led to the appearance of a few GFP expressing cells within the infiltrated area. By 
counting protoplasts prepared from the infiltrated area, we found that only one to three 
protoplasts expressed GFP from a total of 500 protoplasts (0.2 to 0.6%). Coinfiltration of 
plCH14833 and plCH10745 led to the formation of GFP-expressing foci that grew from 
each initial GFP-expressing cell. Ultimately, due to cell-to-cell movement, a large 
proportion of cells in the infiltrated area expressed GFP (Fig. 7A). 

RNA viruses such as tobamoviruses replicate in the cytoplasm and never enter the 
nucleus. Therefore, they have evolved in an environment where they are not exposed to 
the nuclear pre-mRNA processing machinery. As a result, it is not surprising that RNA 
replicon transcripts generated in the nucleus from artificial viral constructs may not be 
recognized and processed properly by the RNA processing machinery. Moreover, RNA 
replicons from viral vectors are very large: approximately 7,000 nt in the case of the 
replicon based on TMV. Very few plant genes have such a large size and the majority of 
such genes contains introns that facilitate processing of the pre-mRNAs, export from the 
nucleus, and that improve the stability of the processed transcripts. We therefore 
hypothesized that modifications of the pre-mRNAs that would increase the efficiency of 
accurate processing and of export of correctly processed transcripts from the nucleus to 
the cytosol would lead to an increase of the number of cells that would initiate viral 
replication. It turned out that there are two approaches can be used to make RNA virus- 
based vectors that can more efficiently initiate viral replication after DNA delivery to the 
nucleus: (1) one approach is the removal of sequence features that might induce 
unwanted processing events (such as alternative splicing events using cryptic splice sites, 
or premature termination events); (2) a second approach is the addition of introns to 
increase the amount of properly processed transcripts, to improve export of the RNA from 
the nucleus to the cytoplasm, and/or to improve stability of the transcripts. 
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EXAMPLE 2 

Removal of intron-like sequences increases the frequency of viral RNA replicon formation 
in the cytoplasm 

We analyzed the sequence of the RNA replicon from plCH4351 using the 
Netgenell server program (http://www.cbs.dtu.dk/services/NetGene2/) and noticed several 
intron-like sequence features that might induce alternative splicing events. One such 
feature is a 0.6 kb uridine-rich region (corresponding to nt 827 to 1462 in GenBank 
accession BRU03387) at the beginning of the RdRP (Fig. 2A). This region was replaced in 
plCH14833 by a PCR-mutagenized sequence that differs from the original sequence by a 
54 nucleotide substitution (sequence given in annex as SEQ ID No. 2, Fig. 3). The 52 
nucleotide substitutions were made to replace T-rich sequences by more GC-rich 
sequences. Ail nucleotide substitutions were made silent so as not to change the RdRP 
protein sequence. This mutagenized fragment also contains two nucleotide substitutions 
(at position 829 and 1459; coordinates relative to GenBank accession BRU03387) that 
were introduced to remove putative cryptic splice donor and acceptor sites, respectively. 
To test the effect of these mutations, the resulting clone plCH15466 (Fig. 6A) was 
agroinfiltrated in N. benthamiana leaves with or without plCH10745 (movement protein in 
trans). Eight days after infiltration, a 10-fold increase in the number of GFP expressing 
cells was observed in the area infiltrated with plCH15466 (compared to plCH14833, Fig. 
7). This suggests that removal of intron-like sequences from the viral amplicon prevents 
unwanted alternative splicing events and results in more efficient initiation of viral 
replication. Coinfiltration of plCH15466 and plCH10745 leads to cell-to-cell movement of 
the modified replicon at a similar speed as a non-modified replicon. This shows that the 
modification of the RNA sequence did not affect cell to cell movement of the viral vector. 

EXAMPLE 3 

Removal of intron-like sequences in the MP subgenomic promoter 

A second potentially problematic region corresponds to the MP subgenomic 
promoter (Fig. 2B). This region is very T-rich and resembles intron sequences very 
closely. As a consequence, many cryptic splice donor and acceptor sites are predicted in 
nearby sequences by intron prediction programs. Unfortunately, modifications cannot be 
made easily to this region without affecting subgenomic promoter function. We decided to 
completely mutagenize the entire region without regard for the subgenomic promoter, and 
to provide MP in trans to compensate for the expected loss of MP expression. As MP will 
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not be expressed from this construct, we also deleted most of MP sequence except for the 
3' sequences that contain the CP subgenomic promoter which is required to drive 
expression of the gene of interest. We therefore replaced a 383 bp fragment in 
PICH14833 (bp 4584 to 5455 in GenBank accession BRU03387) by a 297 bp 
mutagenized fragment. The resulting construct plCH15900 (Fig. 6A) was agroinfiltrated in 
Nicotiana benthamiana leaves with or without plCH10745. Interestingly, a huge increase 
in the number of cells initiating replication was detected in comparison to leaf areas 
infiltrated by plCH14833. By counting GFP-expressing protoplasts prepared from 
infiltrated leaf areas, we estimate that this modification results in a 80 to 100-fold increase 
in the number of cells initiating viral replication compared to the unmodified pICH 14833. 
PICH15900 was coinfiltrated with plCH10745 (p35S-MP expression cassette) and an 
increase in GFP fluorescence was detected due to cell to cell movement. This increase 
was however very limited because so many cells already expressed GFP even in the 
absence of cell-to-cell movement. A 1000 fold dilution of the agrobacterium suspension 
containing plCH15900 coinfiltrated with an undiluted suspsension of agrobacteria 
containing plCH10745 gave rise to separate GFP expression foci. Fluorescent foci were 
as bright and of the same size as control foci obtained with plCH14833. This tells us that 
the modification in plCH15900 and the delivery of MP in trans do not compromize 
functionality of the replicon regarding the level of replication, expression of the gene of 
interest and cell-to-ell movement. The same constructs (plCH14933 and plCH15900, with 
or without plCH10745) were coinfiltrated to Nicotiana tabacum leaves. The modifications 
in plCH15900 lead to a similar increase in the number of cells initiating replication (in 
comparison to plCH14833) as they did in N. benthamiana. 

EXAMPLE 4 

Addition ofintrons improves the frequency of formation of functional RNA replicons in the 
cytoplasm 

We then tested whether the addition of introns into viral pro-replicon sequences 
would increase the frequency of initiation of replication. Two constructs were made, 
PICH15025 and plCH15034 (Fig. 6A), each containing three different Arabidopsis thaliana 
introns in two different regions of the RdRP. plCH15025 was designed to contain introns 
in the middle of the RdRP, while plCH15034 contains introns in the 3' end of the RdRP, 
upstream of the MP subgenomic prompter. The introns were amplified by PCR from 
arabidopsis genomic DNA and incorporated into viral sequences using PCR with primers 
overlapping the planned intron/exon junctions. The fragments containing the introns were 
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subcloned into plCH14833 as an Aval Hindlll fragment (SEQ ID No. 4 in the annex) to 
make plCH15025 or as a Pst1 Ncol fragment (SEQ ID No. 5 in the annexe) to make 
plCH15034. 

Both constructs were agroinfiltrated into N. benthamiana leaves and compared to 
PICH14833. Both constructs significantly increased the number of cells initiating viral 
replication (Fig. 7A). This increase was estimated to be on the order of a 50-fold 
improvement relative to plCH14833. Both constructs were also coinfiltrated with a MP 
expressing clone, and cell-to-cell movement was found to be identical as for clones 
without introns. Both constructs were also tested in N. tabacum, and the same 

improvement was observed as in N. benthamiana (Fig. 7B). 

A third clone was made, plCH15499, which contained all 6 introns (Fig. 5, 6B, 7A, 

7B) This construct was tested in N. benthamiana and N. tabacum. This construct was 

more efficient than each individual construct with 3 introns, but the improvement was 

however less than additive. 



EXAMPLE 5 

Addition of introns and removal of intron-like sequences increases the frequency of the 
formation of functional RNA replicons in the cytoplasm 

Removing intron-like features and adding additional introns in one construct 
showed that both types of modifications can contribute to improve initiation of v.ral 
replication. We subcloned the 6 introns of plCH15499 into P ICH15900, which contains the 
mutagenized MP subgenomic promoter region. The resulting clone plCH15860 (Fig. 6B) 
was infiltrated into ML benthamiana leaves and found to work significantly better than 
either parental clones within the range of approximately 50% to 90% of all protoplasts 
expressing GFP (Fig. 7). The best performing construct contains introns within the RdRP 
region and modified MP subgenomic promoter region (plCH16191, Fig. 7C). In 
comparison to a clone without any modification, this represents an 80- to 300-fold 
improvement. This construct was also coinfiltrated with a MP-expressing construct 
(PICH10745) and it was found that the modifications did not compromize cell-to-cell 
movement or replication. 



EXAMPLE 6 

Not all intron additions increase the frequency of appearance of functional RNA replicons 
in the cytoplasm 
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We inserted two different Arabidopsis introns at the beginning of the RdRP. 
resulting in clone plCH15477 (the sequence of this region is shown as SEQ ID No. 7 in 
the annex). The sequence in this region already looks very "exon-like" (e.g. GC-rich 
without cryptic splice sites) before the addition of introns. No improvement on replication 
of viral initiation was seen with this construct. Therefore, not any addition of an intron will 
result in an improvement of the viral vector. It appears that the position chosen for intron 
insertion or mutagenesis is an important parameter. For example, all intron insertions or 
nucleotide substitutions that were made in regions near problematic structures such as 
the MP subgenomic promoter, resulted in large improvements, while insertions of introns 
into sequences that are already "exon-like" did not improve initiation of viral replication in 
a significant way. 

EXAMPLE 7 

Insertion of introns in MP sequences increase the frequency of viral replicon formation 

We first made a frameshift in the MP by digestion with the restriction enzyme Avrll, 
filling and religation. We then inserted two introns in the MP. The resulting clone 
plCH16422 (Fig. 6B) was infiltrated in Nicotiana benthamiana leaves. An about 100-fold 
increase in the number of cells containing the functional viral replicon was detected. 

EXAMPLE 8 

Insertion of introns into a MP containing vector improves the frequency of initiation of viral 
replication of autonomous functional clones 

A Kpn1 EcoRI fragment was subcloned from plCH15499 into plCH8543. The 
resulting clone, 16700 (Fig. 6B) contained a complete viral vector with 6 introns in the 
RdRP. This clone was infiltrated in N. benthamiana leaf and efficiently initiated replication. 
This clone was also able to move cell to cell without the need to provide additional MP in 
trans. 

EXAMPLE 9 

Activation of an inactive replicon stably integrated on a chromosome 

It is also possible to stably transform intron-containing viral vector constructs in 
transgenic plants. To avoid deleterious viral replication that would inhibit plant growth, an 
inactive clone can be made by having part of the vector present in antisense orientation 



(Fig 8) Incorporation of recombination sites and of introns sequences at the extremrt.es 
of the inverted fragment allow for this fragment to be 'flipped' in the correct orientation by 
using an appropriate recombinase. Recombination sites will be completely eliminated from 
the replicon by splicing. Introns in the pro-replicon allow efficient initiation of replication 
after recombination and transcription. In one specific example, the recombination srtes are 
located within the gene of interest and downstream of the pro-replicon. Such configurate 
prevents any gene expression before recombination. Other configurations can be 
considered where the recombination srtes are located in other areas of the pro-rephcon 
such as the RdRP and upstream of the promoter. Intron sequences at the recomb.nat.on 
site have the advantage of allowing to completely remove the recombination site from the 
replicon, but also increases the efficiency of viral replication, as described before. 



EXAMPLE 10 

Plant viral RNA sequences contain potentially unstable regions 

The analysis of RNA profile of selected plant RNA viruses as well as one well 
characterised plant gene (AtDMCI) was performed by using the Netgenell sender program 
(http //www.cb S .dtu.dk/services/NetGene2/). The RNA profile shown in Fig. 9 for AtDMCI 
clearly reflects the presence of 14 introns (circled), previously identified by comparing the 
cDNA and genomic DNA sequences. It is. evident that RNA profiles of two plant viruses 
have regions (see the Figures 10, 11) which might cause problems for the stability of sa.d 
RNA if they are placed in plant nuclear environment. We have analysed the RNA profiles 
of several other representatives of plant RNA viruses (not shown), such as Brome mosa.c 
Virus different strains of TMV, and many others. All of them have potential problematc 
regions that might compromise the efficiency of plant RNA virus-based replicon format,on 
if delivered into the plant cell as DNA precursors. 



ANNEX 

SEQ ID No. 1 (Ncol-EcoRI fragment of plCH14833): 
ccatggac^aagtgataaaggcagctt^^ 
gcgggcgcgaacctcatgtggaacttcgagg^ 
atgatagaggagccattgtgtattac^atccgct^^ 

aagagttacgcgagtcmgtgtgatgtagctagtaacttaaataattgtgcgtattmcacagttagatgaggccgttgccgag 
tcataagaccgcggtaggcggttcgwg^^ 

gataatgtcgatagtctcgtacgaacctaaggtgagtgamcctcaatcWcgaagaaggaagagatcttgccgaag^ 
aacgaggttagaattc 
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SEQ ID No. 2 (part of plCH1 5466): 

ggagataacctgagcttcttcttccataatgagagcactctcaattacacccacagcttcagcaacatcatcaagtacgtgtgc 

aagacgttcttccctgctagtcaacgcttcgtgtaccacaaggagttcctggtcactagagtcaacacttggtactgcaagttca 

cgagagtggatacgttcactctgttccgtggtgtgtaccacaacaatgtggattgcgaagagttttacaaggctatggacgatg 

cgtggcactacaaaaagacgttagcaatgcttaatgccgagaggaccatcttcaaggataacgctgcgttaaacttttggttcc 

cgaaagtgagagacatggttatcgtccctctctttgacgcttctatcacaactggtaggatgtctaggagagaggttatggtgaa 

caaggacttcgtctacacggtcctaaatcacatcaagacctatcaagctaaggcactgacgtacgcaaacgtgctgagcttc 

gtggagtctattaggtctagagtcataattaacggtgtcactgccaggtctgaatgggacacagacaaggcaattctaggtcc 

attagcaatgacattcttcctgatcacgaagctgggtcatgtgcaagat 
SEQ ID No. 3 (part of pICH 15900): 

gcggacgatacgtgatccaccatgatagaggagccattgtgtattacgatccgcttaaactaatatctaagctcggctgcaag 
cacatcagagacgtcgtgcacttagaagagttacgcgagtctttgtgcgacgtagctagtaacttgaacaactgcgcctacttc 
tcacagttagatgaggccgttgctgaggtccacaagactgcggtcggaggctccttcgcgttctgtagcatcatcaaatacttgt 

cagacaagaggctgttcagggacctgttcttcgtctgagttgacg 
SEQ ID No. 4 (part of pICH 15025): 

cccgagctatactgtaccttcgccgaccgattggtactacagtacaagaaggcggaggagttccaatcgtgtgatctttccaaa 
cctctagaagagtcagagaagtactacaacgcattatccgagctatcagtgcttgagaatctcgactcttttgacttagaggcgt 
ttaagactttatgtcagcagaagaatgtggacccggatatggcagcaaaggtaaatcctggtccacacttttacgataaaaac 

acaagattttaaactatgaactgatraataatcattcc^ 

ggtggtcgtagcaatcatgaagtcagaattgacgttgcctttcaagaaacctacagaagaggaaatctcggagtcgctaaaa 

ccaggagaggggtcgtgtgcagagcataaggaagtgttgagcttacaaaatgatgctccgttcccgtgtgtgaaaaatctagt 

tgaaggttccgtgccggcgtatggaatgtgtcctaagggtggtggtttcgacaaattggatgtggacattgctgatttccatctca 

agagtgtagatgcagttaaaaagggaactatgatgtctgcggtgtacacagggtctatcaaagttcaacaaatgaagaacta 

catagattacttaagtgcgtcgctggcagctacagtctcaaacctctgcaaggtaagaggtcaaaaggtttccgcaatgatcc 

ctctttttttgtttctctagtttraagaattgggtata^ 

tatgttttaggtgcttagagatgttcacggcgttgacccagagtcacaggagaaatctggagtgtgggatgttaggagaggac 

gttggttacttaaacctaatgcgaaaagtcacgcgtggggtgtggcagaagacgccaaccacaagttggttattgtgttactca 

actgggatgacggaaagccggtttgtgatgagacatggttcagggtggcggtgtcaagcgattccttgatatattcggatatgg 

gaaaacttaagacgctcacgtcttgcagtccaaatggtgagccaccggagcctaacgccaaagtaattttggtcgatggtgtt 

c^cggttgtggaaaaac^aaggagattatcgaaaaggtaagttctgcatttggttatgctccttgcattttaggtgttcgtcgctctt 

ccatttocatgaatagctaagatttttttl^ 
ggtaaacttctctgaagacttgatttlagtccctgggaaggaagctt 
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SEQ ID No. 5 (part of plCH15034): 

ctgcaggtaaaatattggatgccagacgatattcmct^ 

aggtc^ataattttttWggagccatt^ 

tgcaatgatcaaaagaaacatgaatgcgccggamgacagggacaattgacattgaggatactgcatctctggtggttgaaa 
agttttgggattcgtatgttgacaaggaatttagtggaacgaacgaaatgaccatgacaagggagagcttctccaggtaagg 

acttctcatgaatattagtggcagattagtgttgtt^^^^ 

acaattaaaggtggctttcgaaacaagagtcatctacagttggtcagttagcggactttaactttgtggatttgccggcagtagat 

gagtacaagcatatgatcaagagtcaaccaaagcaaaagttagacttgagtattcaagacgaatatcctgcattgcagacg 

atagtctaccattcgaaaaagatcaatgcgatmcggtccaatgttttcagaacttacgaggatgttactcgaaaggattgactc 

ttcgaagtttctgttctacaccagaaagacacctgcacaaataga^ 

aattctggaactcgacatttcgaagtacgataagtcacaaaacgagttccattgtgctgtagagtacaagatctgggaaaagtt 
aggaattgatgagtggctagctgagtfctggaaacaaggtgagtt^^ 

cagatcaacatcaaaattaggttcaaMcatcaac<^aataatattWcatgtatatataggtcacagaaaaa(^ 

agattatacggccggaatraaaacatgtctttggtat^^ 
attgccgcatgtttgagctcaatgatccccatgg 

SEQ ID No. 6 (fragment of plCH1 5477): 

gttttagttttattg^acaa^acaaraaattacaataacaacaaacaaaatacaaacaacaacaacatggcacaamra 

acaaacaattgacatgcaaactctccaagccgctgcgggacg^acagcttggtgaatgaWggcatctcgtcgcgttta^ 

ataatgcagtc^aggagctgaatgctcgttccagacgtcccaaggtaaaacaacatttcattcacatatatg 

tgagtacgaagaagacacttactacttgttgatgaaagtttccgcctttatacttatctatatc^tWcatc^^ 

aattaggtgatgtttatatgatatcatggaacattaatctatagggaaactgtWgagttagtWgtataata^ 

aggttcamctccaaggc^gtgtctacggaa^ 

tacgcaatccgctgtgcactccttggccggaggccttcggtcacttgagttggagtatct^tgatgcaagttcc^ 

gaccta^acatcggcggaaacttctccgcgcacctcttcaaaggtaattttcWctctactcaa^ 
gaagactgatctatagttaaaattaatctctactajattcttg^acctcaggtcgcgattacgttcactgctgratgc 
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Claims 



A stably transformed plant, plant part, or plant cell culture containing in cell nuclei 
a heterologous DNA having a sequence encoding an RNA replicon operably linked 
or linkable to a transcription promoter, 
wherein said sequence encoding an RNA replicon contains 

(i) sequences for replicon function of said RNA replicon, said sequences 
being derived from a sequence of a plant RNA virus, and 

(ii) a sequence of interest to be expressed from said RNA replicon, 
whereby said sequences for replicon function exhibit at selected localities of said 
sequence of said plant RNA virus function-conservative differences from said 
sequence of said plant RNA virus, said differences causing an increased 
frequency of replicon formation compared to an RNA replicon not exhibiting said 
differences. 

The plant, plant part, or plant cell culture according to claim 1, wherein said 
function-conservative differences comprise a reduction of the deleterious effect in 
cell nuclei of a high A/U content in a transcript of said heterologous DNA on RNA 
replicon formation. 

The plant, plant part, or plant cell culture according to claim 1 or 2, wherein said 
selected localities of said sequences of said plant RNA virus are localities of high 
A/U content. 

The plant, plant part, or plant cell culture according to any one of claims 1 to 3, 
wherein said function-conservative differences comprise a reduction of a high A/U 
content in sequences for replicon function of said RNA replicon. 

The plant, plant part, or plant cell culture according to claim 4, wherein said high 
A/U content is reduced by at least partial deletion or at least partial replacement by 
G/C bases. 

The plant, plant part, or plant cell culture according to any one of claims 1 to 5, 
wherein said function-conservative differences comprise removal of cryptic splicing 
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sites flanking A/U-rich regions. 



The plant, plant part, or plant cell culture according to any one of claims 1 to 6, 
wherein said function-conservative differences comprise the insertion of one or 
more nuclear introns or one or more sequences capable of forming nuclear introns 
near or within A/U-rich localities of said sequences for replicon function. 

The plant, plant part, or plant cell culture according to claim 7, wherein said 
sequence(s) capable of forming a nuclear intron are capable of forming a nuclear 
intron by site-specific recombination. 

The plant, plant part, or plant cell culture according to any one of claims 1 to 8, 
wherein said sequence encoding an RNA replicon has one or more segments that 
code together for said RNA replicon, whereby formation of said RNA replicon 
requires rearrangement of said segments. 

The plant, plant part, or plant cell culture according to any one of claims 1 to 9, 
wherein all cell nuclei contain said heterologous DNA. 

The plant, plant part, or plant cell culture according to any one of claims 1 to 10, 
wherein said heterologous DNA is contained in a chromosome of said cell nuclei. 

The plant, plant part, or plant cell culture according to claim 11, wherein all cells of 
said plant, plant part, or plant cell culture contain said heterologous DNA in a 
nuclear chromosome. 

The plant, plant part, or plant cell culture according to one of claims 1 to 12, 
wherein said sequence encoding an RNA replicon further contains at localities of 
high A/T(U) content in said sequence of interest function-conservative differences. 

The plant, plant part, or plant cell culture according to any one of claims 1 to 13, 
wherein said plant RNA virus is a tobamovirus, preferably a tobacco mosaic virus. 



The plant, plant part, or plant cell culture according to any one of claims 
wherein said function-conservative differences are function-silent. 
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The plant part according to any one of claims 1 to 15, wherein said plant part is a 
seed. 

A process of expressing a sequence of interest in a plant, plant part, or plant cell 
culture, comprising: 

(a) providing a plant, plant part, or plant cell culture containing in cell nuclei a 
heterologous DNA having a sequence encoding an RNA replicon operably 
linked or linkable to a transcription promoter, 
wherein said sequence encoding an RNA replicon contains 

(i) sequences for replicon function of said RNA replicon, said 
sequences being derived from a sequence of a plant RNA virus, 

(ii) a sequence of interest, 

whereby said sequences for replicon function exhibit at selected localities 
of said sequences of said plant RNA virus function-conservative 
differences from said sequence of said plant RNA virus, said differences 
causing an increased frequency of replicon formation compared to an RNA 
replicon not exhibiting said differences; and 
(b) causing expression of said sequence of interest. 

The process according to claim 17, wherein said heterologous DNA is stably 
incorporated in a nuclear chromosome. 

The process according to claim 17 or 18, wherein said sequence of interest 
encodes a protein of interest and step (b) comprises causing expression of said 
protein of interest 

The process according to claim 19, wherein said expression of said protein leads 
to production of a desired product in said plant, plant part or plant cells. 

The process according to one of claims 17 to 20, wherein said sequence encoding 
an RNA replicon has two or more segments that code together for said RNA 
replicon and step (b) comprises rearranging said segments such that said RNA 
replicon can be formed. 

The process according to claim 21, wherein one of said segments is flanked by 
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recombination sites and said rearranging comprises site-specific recombination by 
a recombinase. 

The process according to one of claims 21 to 22, wherein step (b) comprises 
providing a site-specific recombinase to said plant, plant part, or plant cell culture. 

The process according to claim 23, wherein said site-specific recombinase is 
provided by transient transfection with a vector encoding said recombinase. 

The process according to claim 23, wherein said site-specific recombinase is 
stably encoded in a nuclear chromosome under the control of a regulated 
promoter and step (b) comprises inducing said regulated promoter. 

The process according to claim 22 to 25, wherein said recombinase is selected 
from the group consisting of CRE, flippase, resolvase, FLP, SSV1-encoded 
integrase, R recombinase, P hiC31 integrase, Inti integrase, phi 80, P22, P2, 186, 
and P4 recombinase. Tn3 resolvase, the Hin recombinase, the Cin recombinase. 
E. co// xerC and xerD recombinases. Bacillus thuringiensis recombinase. 

The process according to any one of claims 22 to 26, wherein said recombinase is 
formed within plant cells from non-functional fragments of said recombinase by 
affinity interaction. 

The process according to any one of claims 22 to 26, wherein said recombinase is 
formed within plant cells from non-functional fragments of said recombinase by 
intein-mediated protein trans-splicing. 

The process according to one of claims 17 to 21, wherein said heterologous DNA 
is linked to a regulated transcription promoter. 

The process according to claim 29, wherein said regulated promoter is a 
chemically inducible promoter. 

The process according to to any one of claims 17 to 30, wherein said providing a 
plant, plant part, or plant cell culture of step (a) comprises transformation of a 
plant, plant part, or plant cells with a nucleic acid molecule containing said 



heterologous DNA. 
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The process according to any one of claims 17 to 30, wherein said providing a 
plant, plant part, or plant cell culture of step (a) comprises transient transformation 
of a plant, plant part, or cells of a plant cell culture with a nucleic acid molecule 
containing said heterologous DNA. 

The process according to any one of claims 17 to 32, wherein said plant, plant part 
or plant cell culture contains two or more different heterologous DMAs and said 
process comprises formation of two different RNA replicons. 

The process according to any one of claims 17 to 33, wherein said RNA 
replicon is capable of exhibiting an increased frequency of replicon formation in a 
plant, plant part or plant cell culture other than the natural host plant of the RNA 
virus from which said RNA replicon is derived. 

Nucleic acid molecule containing a DNA sequence encoding an RNA replicon 
operably linked or linkable to a transcription promoter, 
wherein said sequence encoding an RNA replicon contains 

(i) sequences for replicon function of said RNA replicon, said sequences 
being derived from a sequence of a plant RNA virus, 

(ii) a sequence of interest to be expressed from said RNA replicon, 
whereby said sequences for replicon function correspond to sequences of said 
plant RNA virus and exhibit at selected localities of said sequences of said plant 
RNA virus function-conservative differences from said sequence of said plant RNA 
virus, said differences being capable of causing an increased frequency of replicon 
formation compared to an RNA replicon not exhibiting said differences, when said 
nucleic acid molecule in introduced in plant cells or a plant. 
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Abstract 



A stably transformed plant, plant part, or plant cell culture containing in cell nuclei a 
heterologous DNA having a sequence encoding an RNA replicon operably linked or 
linkable to a transcription promoter, 

wherein said sequence encoding an RNA replicon contains 

(j) sequences for replicon function of said RNA replicon, said sequences being 
derived from a sequence of a plant RNA virus, and 

(ii) a sequence of interest to be expressed from said RNA replicon, 

whereby said sequences for replicon function exhibit at selected localities of said 
sequence of said plant RNA virus function-conservative differences from said 
sequence of said plant RNA virus, said differences causing an increased 
frequency of replicon formation compared to an RNA replicon not exhibiting said 
differences. 
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